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SUBSTITUTIONS 
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Abstract. We study the diagonals of two-dimensional tilings generated by direct product 
substitutions. The properties of these diagonals are primarily determined by the eigenvalues 
of the substitution matrix, but also the order of the letters in the substitution plays a role. 
We show that the diagonals may fail to be uniformly recurrent, and that the frequencies of 
letters on the diagonal may not exist. We also highlight the connection with the density of 
coincidences and overlap distributions. 
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1. Introduction 

The two-dimensional tilings generated by direct product substitutions have a very simple 
structure, but we will show that the diagonals of these tilings are not that simple. Throughout 
the paper 9 denotes a substitution (morphism) on {0, 1} that admits two fixed points: we will 
require that the first letter of 9(0) is and the first letter of 9(1) is 1. 

The (direct) product substitution 9x9 of 9 with itself is defined by 

6xO(iJ) = 0(%)x6(j), i,je{0,l}, 

where for two words v and w of length m and n we define the product word as the matrix 

v X w = ((Vk.Wp)), , . 

Kl K.I) \<k<m,l<l<n 

For example, when 9 is the Thue-Morse substitution given by 9(0) = 01, and 9(1) = 10 then. 
It is visually more convenient to represent this with lxl colored tiles: 



Figure 1. The Thue Morse direct product substitution, and an order 3 iterate. 

Let u* = #°°(0) and v* = 9°°(1) be the two fixed points of 9. Obviously the 2D-substitution 
9 x9 has four fixed points, u* x u* , v* x v* , u* x v* , and v* xu*. The first two 2D infinite words 
have trivial diagonals: these are just a doubling of u* , respectively v* . However, even though 
the diagonal of u* x v* is also generated by a substitution of length q when 9 is a substitution 
of constant length q, this diagonal is not always so regular as in the u* x u* case. 

From now on, we will often denote the letters of the alphabet {0, 1} x {0, 1} as column vectors, 
for easier reading. It is easy to sec that the diagonal A* = A*(0, 1) of the u* x v* fixed point of 
the Thue-Morse direct product substitution is generated by the substitution 

(?Mi)G) ^ (JMJ)(i> 
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We see that the successive first coordinates of the diagonal will form u* and the successive 
second coordinates of the diagonal will form v* . This will always be the case as soon as 9 is 
continuous, which means that 9(1) = 9(0), where we use the mirror substitution given by = 1 
and 1 = 0. When 9 is not continuous, it is called discrete. The terminology follows [3J. As an 
example of a discrete substitution take 

6»(0) = 011 6»(1) = 101. 

The substitution that generates the diagonal is then given by 

(8) " (8) (!)(!)• (!) " (i)(o)(i). (J) - (J) (?)(!). (!) - (D (§)(!)■ 

It follows that for discrete substitutions the diagonal is less regular: the sequence A* is still 
recurrent, but no longer uniformly recurrent. This is caused by the occurrence of longer and 
longer words consisting solely of the letters (§) and (\). This also leads to the property that (J) 
and (q) have frequency and (§) and (}) have a frequency equal to the original frequencies of 
and 1 in the fixed point u* of 9 (cf. Lemma H). The technical explanation of this phenomenon is 
that for discrete substitutions the substitution that generates the diagonal is no longer primitive, 
but reducible. 

In Section [2] we consider the diagonal problem for substitutions of non-constant length. It 
turns out that in some cases a description of the diagonal is possible with the aid of the notion 
of balanced blocks, which has been used in the literature to compute coincidence densities. In 
Section [3] we extend the notion of balanced blocks, so that it is applicable to a larger class of 
non-constant substitutions. In Section[4]we show how to compute coincidence densities, and give 
an example where the coincidence density (and so the frequencies of letters) does not exist. In 
Section[5]we present a more positive result: using the same substitution to generate a self-similar 
tiling, the frequencies of letters on the diagonal do exist. In fact this is shown to be generally true 
for substitutions with rational eigenvalues. In Section [6] we discuss the case of two eigenvalues 
with modulus larger than 1. In Section [7] we discuss the connection with the notion of overlap 
distribution from statistical physics. 

2. Balanced blocks and the density of coincidences 

Non-constant length substitutions can also generate tilings ([H]). The usual example is the 
Fibonacci substitution n- 01, 1 4 0. But since this substitution does not have two fixed 
points, we will in the following rather consider its nephew given by 

9: 001, lM-10. 

If we want to generate the diagonal A* of the corresponding u* x v* we now have the problem 
that the occurrences of 9(0) and 0(1) in u* and v* are no longer synchronized. The solution to 
this problem stems from |llj , and is known as the Method of balanced blocks. 

Let Ni(A) denote the number of times letter i occurs in the word A. A 'double' word ( 4 ) is 
called a balanced block if 

N (A) = N (B) and N^A) = Nx(B). 
The idea is to cut up the 'double' sequence A* = ( ) into irreducible balanced blocks: 



u = 


001 





01 


10 








1 


001 


10 


V = 


100 





10 


01 








1 


100 


01 



The key observation is that for any balanced block (g) the block y g ^j is obviously again 
balanced. This makes it possible to define a substitution 9 on blocks A — ( g ) by 



6(A) = A 1 ...A 
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FIGURE 2. Direct product of 6 ; 0^010, 1^11010, and second iterate of (*) . 



where Ai . . . A m is the decomposition of A into irreducible balanced blocks. 
For example, taking the first balanced block in the decomposition above: 



001 




001 









01 




10 


100 




100 









10 




01 



Continuing in this way, and identifying the blocks ( ^ ) and ( \ ) , one obtains 9 defined on the 
alphabet of (identified) irreducible balanced blocks 



001 




01 







111 


100 




10 







' 1 



As in [TT] (see also Lemma [4j it then follows that A* behaves as in the case of discrete substi- 
tutions of constant length: the symbols := (g) and I := (\) dominate. 
Actually [llj focusses on the so called density of coincidences, defined by 

1„ 



Sr : = 



lim — Card{l < k < n : it, 

n— >oo TL 



Since counting coincidences is equivalent to counting occurrences of and 1, the result is that 
S c = 1 for this substitution. This plays an important role in the determination of the spectra of 
dynamical systems generated by the substitutions, see e.g., [11], and [7]. Crucial there is that 9 
is a substitution of Pisot type, i.e., the second eigenvalue of Mg lies in the unit circle. In pQ it 
is shown that this implies that the number of irreducible balanced blocks is finite. 

We return to our original question: what is the structure of the diagonal of a direct product 
substitution? As an example of a non-Pisot we consider a substitution of Salem type, i.e., the 
second eigenvalue lies on the unit circle. We take 

9: 0^010, 1 H- 11010. 

Here the 'double' fixed point sequence A* = ( "» ) starts with the irreducible balanced block 



01011 
11010 



Then 9 ( 



01011 
11010 



) contains the irreducible balanced block 



01001011 
11010010 



and the 



0-image of this block contains a irreducible balanced block of length 27. One can show that 
arbitrary long irreducible balanced blocks will occur. In particular we do have a problem to 
compute 8 C for this 9. 



3. Balancing with weights 

To tackle the problem posed at the end of the previous section we generalize the notion of 
balanced blocks. 

A weight vector w T = (wo,wi) consists of two positive reals wq,wi. A block A = (4) 
consisting of two words A and B is called a w -balanced block if 

N (A)w + Ni(A)wi = N (B)w + N 1 (B)w ll 

which we can shortly write as N(A) T w = N(B) T w. 



4 



F. MICHEL DEKKING 



Observe that this generalizes the notion of balancedness in the sense that A balanced implies 
A is unbalanced, irrespective of the choice of w. Throughout the paper the eigenvalues of Mg 
will be denoted by A° and A, where A is the eigenvalue with the largest modulus, also called PF 
eigenvalue. 

Lemma 1. Let 9 be a primitive substitution with matrix Mg. Let w be a Perron Frobenius right 
eigenvector of Mg: Mgw — Au. Then A w-balanced <=> 0(A) w-balanced. 

Proof: Recall that Mg(i,j) equals the number of times the letter j occurs in 9(i). Thus 

N{8(A)) T w = N(A) T M e w = \ N(A) T w = A N(B) T w = N(B) T Mg w = N(9(B)) T w. □ 

It will be useful for us to have an explicit expression for the Perron Frobenius right eigenvector. 
We denote by \w\ the length of a word w. 

Lemma 2. Let 9 be a primitive substitution with matrix Mg. Let A°, A with |A°| < A be the 
eigenvalues of Mg. Then w T — (\8(0)\ — A°, |0(1)| — A°) is a right eigenvector for A. 

Proof: By Cayley Hamilton we obtain (Mg - A Id)(Mg - \° Id) ({) = (g) . □ 

We continue with the substitution 9 : H- 010 1 i-^- 11010; here Mg = (\\). A right 



eigenvector u is u 
unbalanced block 



(1,2). So the 'double' sequence A* 



C = 



010 

lif 



directly followed by 



D = 



starts with the irreducible 



to 10 



By Lemmajl] we can decompose the #-image of these two blocks in irreducible u-balanced blocks: 



9(C) = 






1 


0J 


Jl 1 


1 OJJOJJIJJ 


1 


1 


(0 


1 


1 lffOfjTpf 



9(D) = 



1 


1 


0J 


Jl 


1 





1 








1 1 


(0 


1 








1 






■■Or, 



Note the occurrence of the 'skewed' u-balanced blocks 0^ —: PI and 1^ =: 
We see that A* is generated by the substitution given by: 

9(6) = 010, 0(1) = 11010, 9(6^) = ^^% 9(1^ = ^^%^% 

9(c) = cdco^\% 9(b) = be £> 6 i o. 

However,... we tacitly assumed that the blocks keep their 'shape'! Actually if A = ax . 
and B = bi . . .b n , then a u-balanced block has a shape as 

Joi a m [ 

[ b\ . . . b s b s+ i . . . b s+m b s+m+ i . . . b n -ib n ] ■ 

So a u-balanced block has two more parameters, a left slope s and a right slope m — n + s. 
By definition the left slope of 9(A) is equal to the left slope of A. 

Lemma 3. Let 8 be a primitive substitution of Salem type. Let A = ( g ) be a w-balanced block. 
Then the right slope of 8 '((b)) * s eaua ^ t° the right slope of (g). 

Proof: Suppose that A° = 1, and let s be the left slope of A. We have to show that — 
\9(B)\ + s= \A\ - \B\ + s, which holds if and only if \9(A)\ - \A\ = \9(B)\ - \B\. The left hand 
side equals 

\9(A)\ - |A| = N Q (A)\9(0)\ + JVi(A)l©(l)| - N (A) - N,(A) 

= iVo(A)(|0(O)| - 1) + Ni(A)(\0(l)\ - 1) = N(A) T w, 

where w is a Perron Frobenius right eigenvector of Mg (using Lemma [2]). Since (g) is balanced 
A^) T u = N(B) T w, and it follows that \9(A)\ - \A\ = \9(B)\ - \B\. 

In case A° = —1, one replaces 9 by 9 2 , and proceeds as above. □ 
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4. Computing the coincidence density 

The following general lemma will be useful. 

Lemma 4. Let (a n ) be a sequence of real numbers such that lim„_>. 00 — X)"=i a « = a ex ^ s l s - Let 
(rife) and (tk) be two sequences with tk > 0, rife — >■ oo and lim sup rife /ife < oo. Then 

lim — aj = a. 

k— yoo t k ^ — 4 

i=n h +l 

Proof: Let e > 0. For all large n we will have n(a — e) < Yh=i a i — n ( a + e )- So 

rtk+tk "fe 

(n k + t k )(a - e) - n k (a + e) < ^ at ~^ai < (n k + t k )(a + e) - n k (a - e), 

1=1 i=l 

which is equivalent to 

nk+tk 

t k a - e(t k + 2n k ) < 2J ai < t k a - e(t k + 2n k ). 

i=n k +l 

Dividing through by t k , and letting A; — > oo, the result follows by next letting e | 0. □ 



Our main result is 

Theorem 1. Let 9 be a primitive substitution on {0, 1}* of Salem type with 9(0) = . . . , and 
9(1) = 1 . . . . Let w be a Perron- Frobenius right eigenvector of Mg. Suppose the set of w-balanced 
blocks is finite, and contains both and 0\. Then the coincidence density of 9 does not exist. 

Proof: Suppose that 5 C does exist. Let jo be a position in the double sequence x where does 
occur. Let A = X\ . . . Xj -i- It follows from Lemma Mtb&t for all k > 1 the block 9 k (6) does 
occur in x at position n k := |6* fc (A)|. Let t k :— |6* fc (0)|. From the Perron- Frobenius theorem 
one deduces that lim sup rife /i/c < oo. We can therefore apply Lemma |4j and since we have t k 
coincidences in the block 9 k (0), it follows that S c = 1. Similarly, let j\ be a position where 
occurs, and take n k (with j replaced by ji) as before. Ignoring the lower left and the upper 
right letter of 9 k (0\ i ), take t k — \9 k (0)\ — 2. It remains to determine the number of coincidences in 
the block 9 k (0\ i ) (with the two letters removed). For any k this block contains a positive fraction 
of blocks 9(0\), so not to have a contradiction with 5 C = 1, no non-coincidences are allowed for 
9(0^). This requirement leads to 9(0) = ... 0, which contradicts primitivity of 9. Conclusion: 
5 C does not exist. □ 
Applying this result to the substitution 9 : i — ^ 010, In- 11010 from the previous section, 
we find that the coincidence density does not exists here, nor will the frequencies of letters on 
the diagonal A*(0, 1) exist. 

5. SELF-SIMILARITY CAN INCREASE REGULARITY 

As in 13J one obtains a self similar ID tiling from a substitution 9 by associating to each letter 
i an interval of length Wi, where w is a right Perron- Frobenius eigenvector of the substitution 
matrix. This induces a 2D self-similar tiling from the direct product 9 x9 , by associating to each 
pair of letters (i,j) the Wi x wj rectangle. Let us reconsider our example 9 : H 010, 1 H 
11010, choosing w T — (1,2) T . The diagonal of the 2D self-similar tiling associated to this 
example can be generated by a substitution S on eight letters, corresponding to the ways in 
which the diagonal intersects a tile: a for the lxl tile, 6 and c for the 1x2 tile, etc., see 
Figure [3] The substitution 5 is given by 

(5(a) = S(c) — 8(e) = aha, 8(b) = 8(f) = bgfc, 8(d) = 8(g) — dfge, 8(h) = hhaha. 

We can split the word 8(h) into the words h h and aha, and the letter h into two letters h = hh, 
obtaining a new substitution 8 on nine letters generating the diagonal, given by 

8(a) = S(c)= 8(e) = 8(h) = ahha, 8(b) = 8(f) = bgfc, 8(d) = 8(g) = d / g e, 8(h) = hhh h. 
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abed e / g h 



Figure 3. Four tiles, 8 diagonal tiles, and the first iterate of the &-tile and the h-ti\e. 

Now note that S is a substitution of constant length 4, and discrete. This implies that the 
frequency of letters in A*(0, 1) = A* (b) exists (the letters a and h will have frequencies 2/3 and 
1/3, the others frequency 0). This phenomenon is a special case of the following general result. 

Proposition 1. Let 9 be a primitive substitution on {0, 1}* with 9(0) = . . . , 9(1) = 1 . . . , for 
which Mg has rational eigenvalues. Then the frequencies of letters on the diagonal of the 2D 
self-similar tiling generated by 9 x9 do exist. 

Proof: Let A be the PF eigenvalue of 9. Since sum and product of the eigenvalues are integers, 
it follows that A is an integer, and thus there is an integer eigenvector, say (p, q) T , with p, q > 0. 
Ignoring a scaling, we can assume that the ID self-similar tiling generated by 9 is built out of 
two tiles with length p, respectively q. The strategy is to cut up these two tiles in intervals of 
length 1, so that we may describe the diagonal of the direct product by the corresponding lxl 
squares (cf. the example above). Let a^, h = 1, . . . ,p and bi, I = 1, . . . , q be the letters of an 
alphabet A of size p + q. Define the map 7 : {0, 1}* — > A* by 

7(0) = axa-2 . . .a p) 7(1) = &i& 2 • • • b q . 

Next, define the substitution 9 : A* — >• A* by 

9(a k ) = 7(0(O))P - 1)A + 1, kX], k = 1, . . .p, 9(h) = j(6(l))[(l - 1)A + 1, £\], £=l,...,q. 

Here we use the notation w[m, n] :— w m . . .w n , and we remark that such a definition makes 
sense because of the eigenvalue equation Mg (q) = \(q). Now note that the diagonal of the 
direct product self-similar tiling is generated by 9 x 9, where 9 is a constant length substitution. 
As remarked in Section [l] this implies that the frequencies of letters exist. □ 

6. All eigenvalues outside the unit circle 

The case where the modulus of both eigenvalues of Mg is larger than 1 is more difficult. It is 
conjectured in |11) that in this case the coincidence density will not exist. 

First we discuss the case where the eigenvalues of Mg are rational. As in the proof of Propo- 
sition [T] this implies that the smallest eigenvalue A° is an integer. Let tobea right PF vector of 
Mg. The set of w-balanced blocks will be either empty or infinite. This follows from an extension 
of Lemma |3| the left and the right slope of w-balanced blocks are multiplied by A° when passing 
from A to 9(A). Thus the technique of to-balanced blocks is no longer useful. However, for the 
self-similar 2D tilings generated by 9 Proposition [T] applies. 

Secondly we consider irrational eigenvalues. With Lemma[2]one deduces from this irrationality 
that the only w-balanced blocks (with the PF weight vector w) are the balanced blocks. Using 
the condition |A°| > 1, one can show that the set of balanced blocks is either empty or infinite. 

As an example of a substitution with irrational eigenvalues we consider 

9 : i-> 00001, 1 ^1110, 
which has eigenvalues | — and \ + \ 
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Figure 4. The first three approximants to the two curves K(0) and i^(l)] 
generated by the full representation of the substitution i— > 1, 1 i — ^ 1110. 

Proposition 2. The set of balanced blocks of the substitution 8 given &j/ i — > 1, 1 i — ^ 1110 
is empty. 

Proof: We give a geometrical proof. We consider a geometric realization of 6 given by its 
full representation, cf. 4. This means that we consider the morphism / : {0,1}* — > M 2 given 
by /(0) = (1,0) T , /(l) = (0,1) T , and the linear map Lg with matrix Mg. Then we have the 
representation equation 

/ o 9 = Lg o /. 

From [3] we have that curves 

K n (i) :=L- n K[6 n (i)} 

converge exponentially fast to a curve K(i) for i = 0, 1. Here for a word w = w\ . . . w m we denote 
by K[w] the polygonal curve passing successively through the points (0, 0), f(wi), f(wiw 2 ), ■ ■ ■ , 
f(wi . ..w m ). 

In Figure [4] we display K n (0) and K n {l) for n = 0, 1,2. From the exponential convergence 
(speed 1/11), and the fact that these curves (almost) live outside the positive quadrant, it is 
obvious that they will never intersect, and thus there is no balanced block. □ 

The self similar 2D-tiling associated to the substitution i-> 1, 1 1-> 1 1 1 is illustrated 
with Figure l5l The scalings are given by the eigenvector {\ + \^fh, 1). We conjecture that 
there are infinitely many 'types' of squares and rectangles on the diagonal. At least it is clear 
that there are no 'regeneration points' as in the Pisot case, because of Proposition [2j the only 
{\ + ^\/5, l)-balanced blocks are the balanced blocks, and these do not exist. 




Figure 5. Part of the self-similar product tiling generated by the substitution 
1 v 1 , 1 H- 1110. 
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7. Overlap distributions 

The coincidence density S c can be considered as a special case of the so called overlap between 
two sequences x and y defined by 

S c (x, y) := lim — Card{l < k < n : Xk — yk}- 

n— >oo ft 

Suppose x and y come from a shift invariant set X of sequences, equipped with a shift invariant 
measure /i. Then the distribution of S c (-,-) under the measure fx x fj, is called the overlap 
distribution, introduced by Parisi (|10jV It was shown in [5] that if fj, is weakly mixing, then the 
overlap distribution degenerates to a point mass concentrated on /^([0]) 2 + • • • + [i({r]) 2 , where r 
is the number of letters in the alphabet, and [i] is the cylinder {x 6 X : xq = i}. (This follows 
immediately from the ergodic theorem applied with [i X fi.) 

Primitive substitutions 8 generate a shift dynamical system with a unique invariant measure 
/is. The overlap distribution of some of these is determined in [5] (see also [6]). For the substi- 
tution 8 : On-010 1 h-> 1 10 1 studied in Section [3j it is known that fig is weak mixing ([5], 
[5]). So for this substitution 5 c (x, y) exists for /xg-almost all x and y, and for those x and y 

s c (x,y)^M[o}) 2 + MW) 2 = l + l = \- 

By Theorem [I] this means that the pair (u*,v*) is non-generic for the overlap distribution. In 
fact most substitutions with matrix ( \ \ ) and two fixed points will have non-generic pairs (u* ,v*) 
by an application of this theorem. One exception is the substitution given by 8(0) — 010, 0(1) = 
10101, since it generates a periodic sequence. Another is the 'Kakutani' substitution (cf. [3 ) 
given by 

0(O)=OO1, 0(1) =11001. 
Here one can show easily that S c (u* , v*) = S c = |, so in this case the pair (u* , v*) is generic. 

8. Final remarks 

We would like to point out that the phenomenon of non-existence of frequencies of symbols 
can already occur for substitutions of constant length. A simple example is given by 

1h^123, 2^ 222, 3 ^ 333. 

Of the 12 substitutions with matrix (23) (and 8(0) — 0. . . , 8(1) = 1 . . . ) there are 9 for 
which the coincidence density does not exist. Then there is a periodic one, and the 'Kakutani' 
substitution mentioned in the previous section. There is one substitution left, given by 8(0) = 
001, 8(1) = 10110. We do not know how to determine the coincidence density or frequencies 
of letters on the diagonal for this substitution. Nor do we know how to compute these for 
substitutions whose matrix has two eigenvalues outside the unit circle. 
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